A Diachronic Approach for Schwa Deletion in Indo Aryan Languages

نویسندگان

  • Monojit Choudhury
  • Anupam Basu
  • Sudeshna Sarkar
چکیده

Schwa deletion is an important issue in grapheme-to-phoneme conversion for IndoAryan languages (IAL). In this paper, we describe a syllable minimization based algorithm for dealing with this that outperforms the existing methods in terms of efficiency and accuracy. The algorithm is motivated by the fact that deletion of schwa is a diachronic and sociolinguistic phenomenon that facilitates faster communication through syllable economy. The contribution of the paper is not just a better algorithm for schwa deletion; rather we describe here a constrained optimization based framework that can partly model the evolution of languages, and hence, can be used for solving many problems in computational linguistics that call for diachronic explanations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Aspect shifts in Indo-Aryan

The grammaticalization literature notes the cross-linguistic robustness of a diachronic pattern involving the aspectual categories resultative, perfect, and perfective. Resultative aspect markers often develop into perfect markers, which then end up as perfect plus perfective markers. We introduce supporting data from the history of Old and Middle Indo-Aryan languages, whose instantiation of th...

متن کامل

Aspect shifts in Indo-Aryan and trajectories of semantic change1

The grammaticalization literature notes the cross-linguistic robustness of a diachronic pattern involving the aspectual categories resultative, perfect, and perfective. Resultative aspect markers often develop into perfect markers, which then end up as perfect plus perfective markers. We introduce supporting data from the history of Old and Middle Indo-Aryan languages, whose instantiation of th...

متن کامل

Dialects in the Indo-Aryan landscape

The Indo-Aryan language family currently occupies a significant region of the Indian subcontinent, its member languages being spoken in the bulk of North India, as well as in Pakistan, Bangladesh, Nepal, Sri Lanka, and the Maldives. The historical depth of the textual record and the geographical breadth of the Indo-Aryan linguistic area, the diversity of its languages (226 in all), and its many...

متن کامل

The Relationship between Case Marking and S, A, and O in Spoken Sinhala

1. INTRODUCTION. In this paper I examine the relationship between case marking and S, A, and O in spoken Sinhala. I will demonstrate that case roles are not assigned on the basis of grammatical relations, but rather they depend on a series of semantic and lexical principles including volitivity, animacy, semantic roles, and definiteness. This paper will furthermore provide evidence for S, A, an...

متن کامل

Why Indo-Aryan languages adapt English alveolars as reʈroflexes: Acoustic evidence from Punjabi

In Indo-Aryan languages, English loanwords containing the alveolar /t/ are always adapted as retroflex /ʈ/ [1]. It is argued that English alveolars share the cues of release burst with the retroflexes in Indo-Aryan languages [2]. However, no quantitative acoustic evidence is provided by [2] as to what acoustic cues of English alveolars are important for the speakers of Indo-Aryan languages to a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004